Fuzzy Induction in Dynamic User Profiling for Information Filtering
نویسندگان
چکیده
In this paper we investigate the role of the user profile in information filtering and we introduce a novel algorithm for learning the user profile based on user’s initial profile and on a queries’ interpretation using fuzzy generalization (Angryk and Petry 2003). Thousands of documents are usually retrieved by search engines for a given query during an information search on WWW. One way to prune irrelevant documents is to take advantage of the user’s implicit interests to filter the documents returned by the search engine, or to reformulate the query based on these interests. One of the common representations of the documents (and queries) in information retrieval is based on the vector hyperspace model (Salton and McGill 1993). We are using the expanded version of Salton’s vector space model introduced by Barbu and Simina (2003) for its effectiveness in computing the dynamics of the user profile. In contrast with the classical vector space model, this recent model, time-words vector hyperspace, has an additional temporal dimension. The coordinates of the documents and queries vectors are calculated using the traditional TF-IDF technique. Only the queries have a temporal dimension (current interest weight) which is set to a preset positive initial value that decays in time, suggesting that some specific user interests could decrease as time goes on. The user’s categories of interest are computed based on a novel approach using Fuzzy Concept Hierarchies (FCH) extracted from the WordNet ontology. An FCH is built for each of the query’s keywords, using their hypernym chains provided by WordNet. A subunitary membership degree (weight) is assigned for each of the edges of the hierarchy via a bottom-up approach, from the lowest level concept (initial keyword) to the more abstract one. The assignment of the weights is performed according to the following two conditions: 1) The sum of weights assigned to the links outgoing from the original keyword has to be equal to unity.
منابع مشابه
A NOVEL FUZZY-BASED SIMILARITY MEASURE FOR COLLABORATIVE FILTERING TO ALLEVIATE THE SPARSITY PROBLEM
Memory-based collaborative filtering is the most popular approach to build recommender systems. Despite its success in many applications, it still suffers from several major limitations, including data sparsity. Sparse data affect the quality of the user similarity measurement and consequently the quality of the recommender system. In this paper, we propose a novel user similarity measure based...
متن کاملA New Similarity Measure Based on Item Proximity and Closeness for Collaborative Filtering Recommendation
Recommender systems utilize information retrieval and machine learning techniques for filtering information and can predict whether a user would like an unseen item. User similarity measurement plays an important role in collaborative filtering based recommender systems. In order to improve accuracy of traditional user based collaborative filtering techniques under new user cold-start problem a...
متن کاملیک سامانه توصیهگر ترکیبی با استفاده از اعتماد و خوشهبندی دوجهته بهمنظور افزایش کارایی پالایشگروهی
In the present era, the amount of information grows exponentially. So, finding the required information among the mass of information has become a major challenge. The success of e-commerce systems and online business transactions depend greatly on the effective design of products recommender mechanism. Providing high quality recommendations is important for e-commerce systems to assist users i...
متن کاملA Novel Trust Computation Method Based on User Ratings to Improve the Recommendation
Today, the trust has turned into one of the most beneficial solutions to improve recommender systems, especially in the collaborative filtering method. However, trust statements suffer from a number of shortcomings, including the trust statements sparsity, users' inability to express explicit trust for other users in most of the existing applications, etc. Thus to overcome these problems, this ...
متن کاملA Hybrid Approach for User Profiling
With the growing amount of information being digitized, users find it difficult to obtain the most relevant information that is hidden in the deluge of information returned to them by search engines. In this paper, we describe a conceptual model using a hybrid approach based on user profiling approaches and information filtering techniques that can alleviate the problem of information overload....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004